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Amendments to the Claims 



1. (original) A computerized system for augmenting data from a source 
5 database with data from a reference database to generate an augmented 

database that can be used for predictive modeling, comprising: 
a source database comprising structured data; 
a reference database having reference data; 

a locator component configured to use the structured data to locate 
IO reference data in the reference database suitable for association with the 

source database; 

an analyzer component configured to process the reference data into a set 
of descriptors and associating the descriptors to the source data to form 
an augmented database; 
15 a predictive modeling component configured to classify behavior with the 

augmented database; and 

a. data mining component configured to conduct searches of data in the 
augmented database. 

2. (original) The computerized system of Claim 1 , wherein the source database 
20 contains financial transaction data. 

3. (original) The computerized system of Claim 1 , wherein the source database 
contains telephone call detail records, and wherein the reference database 
contains business indices and telephone directories augmented by public 
information on merchants and service providers. 

25 4. (original) The computerized system of Claim 1 , wherein the source database 
contains investment transactions and the reference database contains public 
information regarding companies, mutual funds and/or other investment interests. 

5. (original) The computerized system of Claim 1 , wherein the source database 
30 contains insurance transactions, and wherein the reference database contains 

information regarding insurance products, claims and/or insurance evaluations. 

2 
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6. (original) The computerized system of Claim 1 , wherein the source database 
contains product inventories, and wherein the reference database contains 
information describing products. 

5 7. (original) The computerized system of Claim 1 , wherein the source database 
contains Internet browser view transactions, and wherein the reference database 
contains the Internet pages of the browser view transactions. 

8. (original) The computerized system of Claim 1 , wherein the source database 
10 contains retail transactions at an individual product level, and wherein the 

reference database contains product information from catalogs. 

9. (original) The computerized system of Claim 2, wherein the structured data 
comprises at least a name or identifier corresponding to a merchant, product 

15 and/or service. 

10. (original) The computerized system of Claim 1, wherein the reference 
database contains data in an unstructured format. 

20 11. (original) The computerized system of Claim 10, wherein the reference 
database comprises a public database such as the Internet. 

12. (original) The computerized system of Claim 11, wherein the locator 
component locates electronic pages on the Internet related to merchant, product 

25 and/or service identified of the structured data in the source database. 

13. (original) The computerized system of Claim 12, wherein the locator 
component includes a spider module that searches for embedded links, 
keywords and/or references in the text found at the located electronic pages. 

30 

14. (original) The computerized system of Claim 12, wherein the locator 
component retrieves the natural language text from the located electronic pages. 



PAGE BffiO * RCVD AT 8W2005 4:45:52 PM [Eastern DayBght Time] * SVR:USPT(«FXRF4/26 * DHlS:2738300 * CSID:650 474 8401 * DURATION (mm-ss):13-20 



08/08/2005 MON 13:48 FAX 650 474 8401 GLENN PATENT GROUP 



1007/060 



15. (original) The computerized system of Claim 14, wherein the processing of 
reference data in the reference database is accomplished by reducing the natural 
language text to a set of weighted keywords. 

5 16. (original) The computerized system of Claim 12, wherein the locator 
component validates the located electronic pages using zip code and/or 
Standard Industry Code (SIC) information stored in the source database. 

17. (original) The computerized system of Claim 1, wherein the predictive 
10 modeling module uses one or more of the following methodologies: model-based 

regression, non-parametric regression (e.g., neural networks), Bayesian 
inference, hidden Markov models, fuzzy logic models, evolutionary models, or 
decision trees. 

15 18. (original) The computerized system of Claim 1 , wherein the source database 
comprises account based transactional records and the analyzer component 
aggregates the data from the source database and its associated reference data 
by reference to an account field. 

20 19. (original) The system as defined in Claim 18, wherein the association of 
unstructured data from the reference database is delivered through a predictive 
statistical model built from known historic outcomes associated with records 
within the source database. 

25 20. (original) A computerized system for augmenting data from a source 
database with data from a reference database to generate an augmented 
database that can be used for predictive modeling, comprising: 

a source database comprising a plurality of transaction data records with 
each transaction data record having at least one field identifying a 
30 merchant, product and/or service; 

a merchant identifier database comprising a plurality of reference 
addresses and value description identifiers for merchants, products and/or 
services; 



4 
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a reference database; 

an address locating module configured to search the reference database 
to locate references for merchants, products and/or services identified in 
the source database; 

5 an account description database; 

a transaction augmentation module, configured to attach the value 
description of a particular merchant, product and/or service to the 
transaction data records and store the resulting combined record in the 
account description database; and 

10 a merchant analysis builder module configured to condense the 

re f erences provided by the address locating module into a value 
description and store the value description in the merchant identifier 
database. 

15 21. (original) The system of Claim 20, further comprising an account descriptor 
builder module configured to generate descriptive account records from the 
merchant identifier database and the source database. 

22. (original) The system of Claim 20, further comprising a lexicographic 
20 database configured to index value description identifiers to keywords. 

23. (original) The system of Claim 20, wherein the reference database comprises 
the Internet. 

25 24. (original) The system of Claim 20, further comprising a predictive modeling 
module configured to predict future behavior of accounts, merchants, or other 
entities, using data from the account description database. 

25. (original) The system of Claim 20, further comprising a data mining search 
30 engine configured to conduct keyword searches of the account description 

database to identify accounts, merchants, or products. 
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26. (original) A computerized method of augmenting data from a source 
database with data from a reference database, the method comprising: 

retrieving at least one data record recording an event from the source 
database; 

5 identifying a field in the data record that specifies an entity; 

locating reference data from the reference database that describes the 
entity specified by the entity field; 

processing the reference data to form a set of keyword descriptors 
describing the entity; 

10 augmenting the data record with the keyword descriptors to generate an 

augmented data record describing the entity; 

building an account descriptor database that includes at least one data 
record that correlates the at least one event with the description of the 
entity from the augmented data record; and 
15 searching the account descriptor database for selected data records that 

meet a desired criteria. 

27. (original) The method of Claim 26, wherein the locating reference data 
includes locating data in an unstructured database. 



20 



28. (original) The method of Claim 26, wherein the reference database includes 
at least a portion of the Internet. 



29. (original) The method of Claim 28, wherein the locating reference data 
25 includes locating electronic pages using the entity specified in the at least one 

data record. 

30. (original) The method of Claim 29, wherein locating reference data further 
spidering for additional electronic pages cited within the located electronic pages. 



30 



31. (original) The method of Claim 26, wherein locating reference data includes 
reducing natural language text to keyword descriptors. 
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32. (original) The method of Claim 26, further comprising validating the located 
reference data using data from the at least one data record. 

33. (original) The method of Claim 26, further comprising storing the augmented 
5 data record in a merchant database. 

34. (original) A method of augmenting structured data stored in a source 
database with unstructured data stored In a reference database, comprising: 

reading a data record from the source database; 
10 searching the reference database for information describing the data 

record; 

condensing the information describing the data record into at least one 
keyword description; 

augmenting the data record with the keyword description. 

15 

35. (original) The method of Claim 34, wherein the reference database 
comprises the Internet. 

36. (original) The method of Claim 35, wherein the data record contains at least 
20 a merchant name or identifier. 

37. (original) The method of Claim 36, wherein searching the reference database 
further includes locating electronic pages related to the merchant identified in the 
data record. 

25 

38. (original) The method of Claim 37, wherein searching the reference database 
further includes retrieving the natural language text from the located electronic 
pages. 

30 39. (original) The method of Claim 38, wherein condensing the information 
comprises reducing the natural language text to at least one weighted keyword. 
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40. (original) A computerized system associating unstructured data in a 
reference database with structured data in a source database, comprising: 
means for reading a data record from the source database; 
means for searching the reference database for information describing the 
5 data record; 

means for condensing the information describing the data record into at 
least one keyword description; and 

means for augmenting the data record with the keyword description. 

10 41. (original) The computerized system of Claim 40, wherein the reference 
database comprises the Internet 

42. (original) The computerized system of Claim 41, wherein the data record 
contains at least a merchant name or identifier. 

15 

43. (original) The computerized system of Claim 42, wherein the means for 
searching the reference database includes means for locating electronic pages 
related to the merchant identified in the data record. 

20 44. (withdrawn) 

45. (withdrawn) 

46. (previously amended) A computerized system for augmenting data from a 
25 source database with data from a reference database to generate an augmented 

database that can be used for predictive modeling, comprising: 
a source database comprising structured data; 
a reference database having reference data; 

a locator component configured to use the structured data to locate 
30 reference data in the reference database suitable for association with the 

source database; and 
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an analyzer component configured to process the reference data into a set 
of descriptors and associating the descriptors to the source data to form 
an augmented database^;]]. 

5 47. (Currently Amended) The computerized system of Claim 46, further 
including a predictive modeling component configured to classify behavior with 
the augmented database^f-and 

48. (original) The computerized system of Claim 46, further including a data 
10 mining component configured to conduct searches of data in the augmented 

database. 

49. (original) The computerized system of Claim 46, wherein the source 
database contains financial transaction data. 

15 

50. (original) The computerized system of Claim 47, wherein the structured data 
comprises at least a name or identifier corresponding to a merchant, product 
and/or service. 

20 51. (original) The computerized system of Claim 46, wherein the reference 
database contains data in an unstructured format. 

52. (original) The computerized system of Claim 51, wherein the reference 
database comprises a public database such as the Internet 

25 

53. (original) The computerized system of Claim 52, wherein the locator 
component locates electronic pages on the Internet related to merchant, product 
and/or service identified of the structured data in the source database. 

30 54. (original) The computerized system of Claim 53, wherein the locator 
component includes a spider module that searches for embedded links, 
keywords and/or references in the text found at the located electronic pages. 
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55. (original) The computerized system of Claim 53, wherein the locator 
component retrieves the natural language text from the located electronic pages. 

56. (original) The computerized system of Claim. 55, wherein the processing of 
5 reference data in the reference database is accomplished by reducing the natural 

language text to a set of weighted keywords. 

57. (original) The computerized system of Claim 53, wherein the locator 
component validates the located electronic pages using zip code and/or 

10 Standard Industry Code (SIC) information stored in the source database. 

58. (original) The computerized system of Claim 46, wherein the predictive 
modeling module uses one or more of the following methodologies: model-based 
regression, non-parametric regression (e.g., neural networks), Bayesian 

15 inference, hidden Markov models, fuzzy logic models, evolutionary models, or 

decision trees. 

59. (original) The computerized system of Claim 46, wherein the source 
database comprises account based transactional records and the analyzer 

20 component aggregates the data from the source database and its associated 

reference data by reference to an account field. 

60. (original) The system as defined in Claim 59, wherein the association of 
unstructured data from the reference database is delivered through a predictive 

25 statistical model built from known historic outcomes associated with records 

within the source database. 
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